Active metric learning for supervised classification

نویسندگان

چکیده

Abstract Clustering and classification critically rely on distance metrics that provide meaningful comparisons between data points. To this end, learning optimal functions from data, known as metric learning, aims to facilitate supervised classification, particularly in high-dimensional spaces where visualization is challenging or infeasible. In particular, the Mahalanobis default choice due simplicity interpretability a transformation of simple Euclidean using combination rotation scaling. work, we present several novel contributions both by way formulation well solution methods. Our approach motivated agglomerative clustering with certain modifications enable natural interpretation user-defined classes clusters metric. generalizes improves upon leading methods removing reliance pre-designated “target neighbors,” “triplets,” “similarity pairs.” Starting definition generalized has second order term, propose an objective function for selection does not aim isolate each other like most previous but tries distort space minimally aggregating co-class members into local clusters. Further, formulate problem mixed-integer optimization can be solved efficiently small/medium datasets approximated larger datasets. Another salient feature our method it facilitates active recommending precise regions sample improve performance. These are indicated boundary outlier points dataset defined This targeted acquisition significantly reduce computation ensuring training completeness, representativeness, economy, which could also advantages established Deep Learning Random Forests. We demonstrate computational performance through intuitive examples, followed results real image benchmark

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Active Metric Learning for Supervised Classification

Clustering and classification critically rely on distance metrics that provide meaningful comparisons between data points. We present mixedinteger optimization approaches to find optimal distance metrics that generalize the Mahalanobis metric extensively studied in the literature. Additionally, we generalize and improve upon leading methods by removing reliance on pre-designated “target neighbo...

متن کامل

Semi-supervised Distance Metric Learning for Visual Object Classification

This paper describes a semi-supervised distance metric learning algorithm which uses pairwise equivalence (similarity and dissimilarity) constraints to discover the desired groups within high-dimensional data. As opposed to the traditional full rank distance metric learning algorithms, the proposed method can learn nonsquare projection matrices that yield low rank distance metrics. This brings ...

متن کامل

Supervised Learning for Classification

Supervised local tangent space alignment is proposed for data classification in this paper. It is an extension of local tangent space alignment, for short, LTSA, from unsupervised to supervised learning. Supervised LTSA is a supervised dimension reduction method. It make use of the class membership of each data to be trained in the case of multiple classes, to improve the quality of classificat...

متن کامل

Semi-Supervised Active Learning for Sound Classification in Hybrid Learning Environments

Coping with scarcity of labeled data is a common problem in sound classification tasks. Approaches for classifying sounds are commonly based on supervised learning algorithms, which require labeled data which is often scarce and leads to models that do not generalize well. In this paper, we make an efficient combination of confidence-based Active Learning and Self-Training with the aim of minim...

متن کامل

Semi-Supervised Metric Learning Using Pairwise Constraints

Distance metric has an important role in many machine learning algorithms. Recently, metric learning for semi-supervised algorithms has received much attention. For semi-supervised clustering, usually a set of pairwise similarity and dissimilarity constraints is provided as supervisory information. Until now, various metric learning methods utilizing pairwise constraints have been proposed. The...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Computers & Chemical Engineering

سال: 2021

ISSN: ['1873-4375', '0098-1354']

DOI: https://doi.org/10.1016/j.compchemeng.2020.107132